Notes
- In minitax, reads were NOT filtered for MAPQ
- In minitax, results were NOT normalized to Genome Size
- Unclassified reads were NOT excluded
- Eukaryotes were NOT excluded from the analyis
- The Gold Standard (Theoretical composition) was: Zymo D3600
- The taxonomic lineage of the Gold Standard was taken from NCBI
Zymo Gold Standard
composition
Limosilactobacillus fermentum in the theoretical composition
was changed to
Lactobacillus fermentum
Detection Statistics
based on Taxa presence/absence on different levels
- Precision= true positives /(true positives + false positives)
- Recall= true positives /(true positives + false negatives)
- F1=(2∗ precision ∗ recall)/(precision + recall)
- F0.5=((1+0.52)∗ precision ∗ recall)/((0.52∗ precision)+ recall)
Species-level

Genus-level

Relative abundance of
taxa at each taxonomic level
Phylum level

Order level

Genus level

Species level

Correlations
TIDY UP THE CODE HERE!
The correlations between the theoretical and observed composition are
shown.
Phylum level

Order level

Genus level

Species level

Summarised r2
values

Chi-square tests
Is the observed distribution significantly different from the
theoretical?
Species level

Genus level
